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Abstract: 

We consider the problem of obtaining locally D-optimal designs for factorial exper- 
iments with qualitative factors at two levels each with binary response. Our focus 
is primarily on the 2 2 experiment. In this paper, we derive analytic results for 
some special cases and indicate how to handle the general case. The performance 
of the uniform design in examined and we show that this design is highly efficient 
in general. For the general 2 k case we show that the uniform design has a maxvmin 
property. 
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1. Introduction 

The goal of many scientific and industrial experiments is to study a process that 
depends on several qualitative factors. We will focus on the design of those 
experiments where the response is binary. If the response was quantitative and a 
linear model was appropriate, then the design of the experiment is informed by 
the extensive literature on factorial experiments. On the other hand if the factors 
were quantitative and the response was binary, the literature on optimal design 
of generalized linear models in the approximate theory setup could be used. The 
goal of our work is to initiate the optimal design theory for factorial experiments 
with binary response. 

Specific examples of experiments of the type we are interested in are available 
in different areas of application. Smith (1932) describes a bioassay for an anti- 
pneumococcus serum where the explanatory variable is doses of the serum. Mice 
infected with pneumococcus are injected with different doses of the serum and 
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the response is survival (or not) beyond seven days. Hamada and Nelder (1997) 
discussed the advantages of using a generalized linear model for discrete re- 
sponses instead of linearizing the response to obtain an approximate linear model. 
They examined an industrial experiment on windshield molding performed at an 
IIT Thompson plant that was originally reported by Martin, Parker and Zenick 
(1987). There were four factors each at two levels and the response was whether 
the part was good or not. A 2 4_1 fractional factorial design was used with 1000 
runs at each experimental condition. Other examples include a seed gemination 
experiment described in Crowder (1978), a sperm survival experiment in Myers, 
Montgomery and Vining (2002) and a designed experiment on the reproduction 
of plum trees reported by Hoblyn and Palmer (1934). 

We assume that the process under study may be adequately described by a 
generalized linear model. While the theory we develop will work for any link 
function, in examples and simulations we focus on the logit, probit, log-log, and 
complementary log-log links. The optimal designs will be obtained using the 
D-criterion that maximizes the determinant of the inverse of the asymptotic co- 
variance matrix of the estimators (the information matrix) . In order to overcome 
the difficulty posed by the dependence of the design optimality criterion on the 
unknown parameters, we use the local optimality approach of Chernoff (1953) 
where the parameters are replaced by assumed values. We refer the reader to 
the paper by Khuri, Mukherjee, Sinha and Ghosh (2006) for details of theory of 
designs for generalized linear models. 

We will assume that every factor is at two levels, a setup of particular interest 
in screening experiments, and that (for an experiment with k factors) we are 
interested in a complete 2 k experiment, i.e., the design may be supported on all 
2 k points. The model we choose may include a subset of all main effects and 
interactions. If we assume that the total number of observations is held fixed, 
then the design problem is to determine the proportion of observations allocated 
to each of the 2 k design points. It may be noted that if the response follows a 
standard linear model, then it follows from the results of Kiefer (1975) that the 
design which is uniform on the 2 k design points is universally optimal. For the 
problem restated in terms of weighing design, Rao (1971) gave the optimality 
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of the uniform design in terms of minimizing variances of each of the parameter 
estimators. It may be noted that the uniform design is an orthogonal array (Rao 
(1947)). 

In this initial study, we focus primarily on the complete 2 2 experiment where the 
response is binary. While we do not find analytic solutions for D-optimal design 
for the general 2 2 experiment, we obtain characterizations for several special 
cases. For the general 2 2 experiment we indicate how a solution may be obtained 
by Cylindrical Algebraic Decomposition (CAD). We also examine uniform designs 
and show that these are highly efficient in general. 

For the general 2 k experiment we show that the uniform design is maximin D- 
optimal design, i.e., a design that maximizes a lower bound of the D-criterion. 

In section 2 we give the preliminary setup. Results for the 2 2 experiment are 
proved in section 3. In section 4 we study robustness of the uniform design, and 
in section 5 we consider the general 2 k experiment. Some concluding remarks 
are given in section 6. Proofs are relegated to the appendix and details are also 
available under the supplementary materials. 

2. Preliminary Setup 

Consider a 2 k experiment, i.e., an experiment with k explanatory variables at 
2 levels each. Suppose rij units are allocated to the ith experimental condition 
such that ?i{ ^ 0, i = 1, . . . , 2 k , and rt\ + ■ ■ ■ + n 2 k = n. We suppose that n is 
fixed and consider the problem of determining the "optimal" n^'s. In fact, we 
write our optimality criterion in terms of the proportions: 

Pi = rii/n, i = l,...,2 k 

and determine the "optimal" p^s. (Since n^'s are integers, an optimal design 
obtained in this fashion may not be "feasible" - an issue we will not deal with, 
except to say that a feasible solution "near" an optimal solution is expected to 
be "nearly optimal"). 

Suppose 7] is the linear predictor that involves main effects and interactions which 
are assumed to be in the model. Our main focus in this initial research on 
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the topic will be the 2 2 experiment with main-effect model, in which case rj = 
00 + Pi^i + P2X2 and (3 = (/3q, /%) ■ 111 the framework of generalized linear 
models, the response Y is linked to the linear predictor by the link function g: 
E (y) = p,, rj = g{n) (McCullagh and Nelder (1989)). For a binary response, 
the commonly used link functions are logit, probit, log- log, and complimentary 
log- log links. 

The maximum likelihood estimator of j3 has an asymptotic covariance matrix 
(Khuri, Mukherjee, Sinha and Ghosh (2006)) that is the inverse of nX'WX, 

where W = diag (itfiPi, ...,w 2 kp 2 k) , Wi = {^^j / (A*i(l — Mi)) > and X is the 
"design matrix". For a main-effect 2 2 experiment, for instance, X = ((1, 1, 1, 1)', 
(1,1,-1,-1)', (1, — 1, 1, — 1)'). The matrix X'WX may be viewed as the per- 
observation information matrix. The D-optimality criterion maximizes the de- 
terminant . 

3. D-Optimal 2 2 Designs 

If k = 2, for a main-effect plan, the asymptotic information matrix is proportional 
to X'WX. It can be shown that |X'WX| can be written as (except for the 
constant 16): 

det(w, p) = G(p) = p 2 w 2 ■ P3W3 ■ PiWi + P\w\ ■ P3W3 ■ PiWi 

+ piwi ■ P2W2 ■ P4W4 + P1W1 ■ p 2 w 2 ■ P3W3 (3.1) 

where w = (wi, W2, W3, W4)' and p = (pi,P2,P3,P4:)' ■ In this section, we will 
consider the problem of maximizing G(p) over all vectors p with Pi > and 

EiPi = i- 

3.1 Analytic solutions to special cases 

It follows from Kiefer (1975) that if all the Wi's are equal then the uniform 
design (pi = P2 = P3 = Pa = 1/4) is D-optimal. If one and only one of the Wi's is 
zero, then the optimal design is uniform over the design points that correspond 
to the nonzero w^s, and if two or more w^s are zero, then G(p) = 0. From 
now on, we assume wi > 0, i = 1,2,3,4. Define L = G / \w1W2W3W4) and 
Vi = 1/wi, i = 1,2,3,4. The maximization problem (|3.1f) can be rewritten as 
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maximizing 

L(p) = ViPip-zPz + V 3 pip 2 PA + V2P1P3P4, + Vip 2 P3Pi- (3.2) 

Although the objective function (|3.2p is elegant, an analytic solution with general 
Vi > is not available. In this subsection, analytic solutions are obtained for some 
special cases. 

Theorem 3.1.1 L(p) has a unique maximum at p = (0,1/3,1/3,1/3) if and 
only if v\ > i>2 + ^3 + v 4 . 

Note that this does not correspond to a complete 2 2 experiment, rather it cor- 
responds to a design supported only on three points, which is saturated for the 
main effects plan rj = /?o + P1X1 + $2X2- For the logit link function, < wi < 0.25. 
Computations under the simulation condition that Wi iid ~ uniform(0,0.25) show 
that the chance of obtaining a saturated solution is 48%. For other link functions, 
the chances are similar. 

Lemma 3.1.1 If v\ > V2, then any solution to the maximization problem of 
must satisfy p\ < P2', if Vi = V2, then any solution must satisfy p\ = p2- 

Theorem 3.1.2 Suppose v\ > v 2 , v 3 = V4 = v, and v\ < v 2 + 2v. Then the 
solution maximizing K3. 2\) is 

1 Vi - V 2 + 4:V 1 vi — « 2 — 4v 2v , . 

Pl ~ 2 " 2(-25 + J D)' P2 ~ 2 + 2(-2S + D)> = Z^Td (3 " 3) 

with L = 2v 2 (5 2 + 4viV2 - 5D) /(— 25 + D) 3 , where 5 = v\ + v 2 - iv and D = 
y/S' 2 + 12 Vl v 2 . 

Corollary 3.1.1 Suppose V2 = V3 = = v and v\ < 3i>. Then the solution 
maximizing \3. 2\) is 

3v — v\ 2v 

Pl = 7\ , P2 = P3 = PA = 7^ 

9v — v 1 \)v — V\ 

with the maximum L = 4v 5 /(9v — v\) 2 . 

Corollary 3.1.2 Suppose v\ = v 2 = u, ^3 = ^4 = v, and u > v. Then the 
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solution maximizing h3. 2\) is 

2u — v — d u — 2v + d 

b(u — v) b[u — v) 

with the maximum L = (2u — v — d)(u — 2v + d)(u + v + d)/ [l08(ii — t>) 2 ] , where 
d = \J u 2 — uv + v 2 . 

Theorem 13.1.11 reveals that the D-optimal design is saturated if and only if 
2maxjUj > ^1 + ^2 + ^3 + va- We call it the saturation condition. In terms 
of iw's, it is 

2/ min{u;i, 102,^3, 104} > l/w\ + l/w 2 + l/w 3 + 1/104. (3.4) 
Theorem 13.1.31 examines this condition in terms of the /3's. 

Theorem 3.1.3 For the logit link, the saturation condition is true if and only if 




Figure 1: Lower boundary of the region satisfying the saturation condition 
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Figure 1 shows how the region satisfying the saturation condition changes with 
/?o- For fixed (3q, a pair (/?i, fa) satisfies the saturation condition if and only if 
the corresponding point in Figure 1 is above the curve labelled by /?o- 

For logit link, /3 , /?i , /3 2 are symmetric with respect to Wi, w<2, w^, W4. If we 
permute /0o,/3i,/?2> we only need to change the order of w\,W2,ws accordingly. 
It won't affect whether or not the saturation condition is satisfied. Therefore, we 
can get parallel results to Theorem 13. 1.31 in any order of /3o,/?i,/32. 

3.2 Exact solution using computer-aid optimization 

For given values of v\, i>2, i>3, f4, depending on the computational resources, one 
can always try to apply some numerical searching algorithm such as Nelder- 
Mead, quasi-Newton, conjugate-gradient, or simply a grid search. However, such 
solutions will not be very accurate in general. 

For general cases other than the special ones discussed above, one may use cylin- 
drical algebraic decomposition (CAD) (Fotiou et al. (2005)) to find an exact 
solution maximizing (13.2|) . For given ui , i>2 , ^3 , ^4, the CAD algorithm partitions 
the feasible domain of (L,p\,p2,pz) into a finite union of disjoint homogeneous 
cells in terms of polynomial constrains. The cell with greatest L provides us a 
solution. However, it is difficult, if not computationally infeasible, to get explicit 
formula involving general v%,V2, v%, V4. 

Besides the CAD approach, one may also use the Lagrange multipliers or the 
Karush-Kuhn- Tucker (KKT) conditions (Karush (1939), Kuhn and Tucker (1951)) 
to find all the local extrema of L, and then pick the largest one. For general 
vi,V2,V3,V4, this approach leads to two polynomial equations of order 4 each. 
No explicit formula involving general v±, «2, ^3,^4 is available. 

3.3 Analytic approximate solution 

For the optimization problem (|3.2p . we do not find analytic solutions for the 
general case. In this section, we will propose an analytic approximate solution. 
To simplify notations, write L[v\, V2, v$, V4] for max p L, given v±,V2,vs,V4. For 
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example, Theorem 13.1.21 corresponds to L[vi,V2,v,v]. Without any loss of gen- 
erality, we assume v± < V2 < V3 < V4 and v 4 < v\ + v% + f 3 . Define 

L 34 = L[v 1 ,v 2 , (v 3 + u 4 )/2, (« 3 + v 4 )/2] 

and L12, £13, Lu, L23, L24 accordingly. The strategy is to use max{Li2, L23, £34} 
to approximate max p L based on the theorem as follows. 

Theorem 3.3.4 Assume v 1 < V2 < V3 < V4 and V4 < v\ + vi + V3. Then 



We call the best p among the solutions to L12, L23, or L34 the analytic approx- 
imate solution, and denote it by p a . Then L(p a ) = max{Li2, L23, L34}. Theo- 
rem [333] provides a theoretical upper bound for the difference max p L — L(p a ). 

To see how our approximation works numerically, we randomly selected 1000 
w = (u>i, W2, W3, W4) with 0.05 < Wi < 0.25. For each randomly chosen w, 
we calculate the optimal p using CAD and denote it by p Q . We also calculate 
the analytic solution based on Theorems 13.1.11 13.1.21 or 13.3.41 and denote it by 
p*. If the saturation condition (|3.4|) is satisfied or W{ = Wj for some i 7^ j, then 
P* = Po- Otherwise, the analytic approximate solution based on Theorem 13.3.41 
is applied, and p* = p a . Then we calculate the determinant of the information 
matrix and denote it by D Q and D* for p Q and p*, respectively. In Figure 2, we 



that 96% of the relative losses are less than 0.03% and the maximum relative loss 
is about 0.085%. So the analytic approximation solution works very well. 

Since Uj's depend on the assumed values, they are in general not quantified ac- 
curately at the planning stage. The results in this section show that setting 
some ViS to be equal is not a bad strategy. The analytic approximation is also 
potentially useful for future theoretical research in this area. 



max{Li3,Li4,L 2 4} < m&x{L 12 , L 23 , L34} 





Numerical results show 
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Figure 2: Efficiency of the approximate solutions 



4. Robustness of uniform design 

If the experimenter is unable to make an informed choice of the assumed val- 
ues for local optimality, the natural design choice is the uniform design p u = 
(1/4, 1/4, 1/4, 1/4). The relative loss of efficiency of p u with respect to the true 
w = (101,^2,^3,^4) is: 

det(w,p t ) 1/3 - det(w,p n ) 1/3 1 ( v\ + v 2 + v 3 + 1 

^4 W ) = ; — ; rrv^ = 1 ~ 7 



det(w,p 4 ) 1/3 4 V L(pt) 

where = 1/wi, Pt is the optimal design under w, and L(pt) is defined in (|3.2p . 
To study the maximum loss of efficiency, we consider Rmlx = max R u (w). The 

w 

(u) 

following theorem finds the values of i?max for different values of w's. 

Theorem 4.1.5 Without any loss of generality, assume v± > v 2 > V3 > V4. 
Then 

3«\l/3 



#0) 

-* 1 TY1 SI V 



1 - f (1 + T) # Ul - V2 + U3 + 1,4 < ° - Vi - b 

1 - I x 2 1 / 3 if vi <v 2 + v 3 + v 4 



If v\ < ^2+^3+^4, the maximum loss of the uniform design is 1 — | x 2 1 / 3 ~ 0.055 
regardless of the choice of link functions. The uniform design clearly performs 



10 



ABHYUDAY MANDAL, JIE YANG AND DIBYEN MAJUMDAR 



very well. Now, let us consider the saturation condition v\ > V2 + Vs + 1)4. For 
some reasonable choices of w's, Table I4TT1 gives results for popular link functions. 
These -Rmax's are significantly larger than 0.055. Moreover, the efficiency of the 
uniform design gets worse as a decreases to zero. This is intuitive because as a 
decreases, 2u max — ^ Vi increases, worsening the performance of uniform design. 
Nevertheless, based on Theorem 13.1.11 if we know that v\ > V2 + vz + V4 the 
design of choice clearly is the saturated design (0, 1/3, 1/3, 1/3). We will discuss 
the utility of saturated designs in practical applications in section 6. It can be 
seen from Theorem 14 . 1 . 5 1 that in general the maximum loss of efficiency is 1/4. 



Table 4.1: Maximum loss of efficiency for v\ > V2 + V3 + V4 





Link functions 


Logit 
0.05 < w < 0.25 


Probit or 
(complementary) log-log 
0.05 < w < 0.65 


JLmax 


0.123 


0.196 



5. General case: 2 k experiment 

The general case of 2 k factorial is technically complicated. We have some results 
for particular cases that will be reported in future publications. In this section 
we will show that the uniform design (p\ = • • • = p 2 k = ^) has a maximin 
optimality property, i.e., it maximizes a lower bound of the D-criterion. 

Suppose there are q parameters (main effects and interactions) in the chosen 
model for the 2 k experiment, i.e. (3 is a q x 1 vector and the design matrix X is 
2 k x q. The matrix X can be extended to the 2 k x 2 k design matrix F of the "full 
model", i.e., one that includes all main effects and interactions. Note that F is 
a Hadamard matrix, i.e., F'F = 2 k I 2 k, where I 2 k is the identity matrix of order 
2 k . Moreover, we can partition F as F = (X\R) where R contains all effects that 
are not in the chosen model. Clearly, 



\F'WF\ = \X'WX\ 



R'WR- R'WX (X'WX) l X'WR 



< \X'WX\ \R'WR\ 



Hence 



OPTIMAL DESIGN FOR BINARY RESPONSE 11 

\F'WF\ 



\x'wx\ > 



\R'WR\ 



Note that \F'WF\ = 2 k m Wi Yl Pi , and \R'WR\ ^ {w M f k ~ q \BfPR\ where w M = 
maxjtoi, w 2 k}, and P = diag (p\, ...,p 2 k). It follows from Proposition 1' of 
Kiefer (1975) that the uniform design pi = ■ ■ ■ = p 2 k = ^ maximizes \R'PR\. 
Hence we obtain a lower bound to the D- criterion 

\x>wx\ > 2 * 2 ' n "" n « 



{WRI) 2 q 

This lower bound is a maximum when p\ = ■■■ = p 2 k = p-, which gives a 
maximin property of the uniform design. 

How good is the uniform design? The loss of efficiency corresponding to the 
uniform design is 



Ru{w) = l - 
Then it can be shown that 



\X'diag ( Wl /2 k ,...,w 2 k/2 k )X\ 
max\X'WX\ 



1/9 



^(tO) < 1 -T ^ 1 

2 k W M WM 

where w m = min{tt;i, ...,w 2 k} and Wo = diag (w\, ...,w 2 k) . 

For example, suppose it is known that 0.14 ^ Wi ^ 0.20. Then the uniform design 
is not less than 70% efficient, irrespective of the value of k or the X matrix (the 
model). While this bound enables us to make general statements like this, the 
results in Theorem 14.1.51 for the 2 2 case shows that this bound can be quite 
conservative. 

6. Discussion 

Locally optimal designs require assumed values of the parameters w which may 
not be readily available at the planning stage. The expression of Wi given in 
section 2 depends on the assumed values of the parameter (3 and the link func- 
tion rj. In situations where there is overdispersion, this expression (the nominal 
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variance) may not be adequate to describe variation in the model, and a more 
realistic representation for w may be 



where c% is a function of the factor levels x%, X2, ■ ■ ■ , Xk- This will make the speci- 
fication of Wi even more difficult. For the design problem, however, it can be seen 
that we need only the relative magnitudes w* = Wi/wM, which may be easier to 
specify in some applications. Note that w* take values in the interval (0, 1]. 

The overall conclusion for the 2 2 factorial experiment with main effects model 
is that, for the link functions we studied, the locally optimal designs are robust 
in the sense that the loss of efficiency due to misspecification of the assumed 
values is not large. If the (assumed) variance at one point is substantially larger 
than the others, then the D-optimal design is based on only 3 of the 4 points. 
In real world experiments, however, an experimenter would rarely feel confident 
to not allocate observations at a point based solely on assumed values, and this 
is not our recommendation for practice. However, the D-optimal design would 
still provide a useful benchmark for the efficiency of designs, and to the extent 
feasible it is wise to dedicate more resources to points that we believe have small 
variance and less resources to points with large variance. If there is no basis to 
make an informed choice of the assumed values, we can recommend the use of 
the uniform design. 

We have extensive results on the robustness of the 2 2 designs against the mis- 
specification of w, which will be reported in another publication. This paper 
consists of initial results in this area of optimal designs for two level factorial 
experiments with binary response. Our research is ongoing on extending these 
results to more general factorial and fractional factorial experiments. 
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Appendix 

1 . Proof of Theorem \3.1.1\ 

(1) If V\ > v 2 + V3 + t>4, then 

L = V±(pip 2 pz + P2P3PA) + V3(PlP2P4 + P2P3Pa) + V 2 (piP3PA + P2P3Pa) 
+ (vi -V 2 -V 3 - Vi)p 2 PzPi 
= Va (pi + Pi )p 2 P3 +V 3 (pi+p 3 )P2PA + V 2 (pi + P2 )P3P4 
+ («! -V2-V3- Va)P2P3PA 
. ( {pi +Pa) +P2 +P3\ 3 , ( {Pi +P3) +P2 +P4 X 3 



. ( (gl +P2) +P3 +P4 ^ 3 . , » 

+V 2 I I + (Wl - t>2 - U 3 - t>4jP2P3P4 (A.l) 

VA V3 V 2 , s 

= 27 27 + 27 + ~ V2 ~ V3 ~ V4 ' P2P3P4 

V2+V3+V4 . , s / P2 +P3 +P4 ^ 3 „ A 

< 27 + ^ 1 ~ V2 ~ V3 ~ VA > I 3 J t ) 

, V 2 +Vz + V A f pi +p 2 +P3 + PA V „x 

< ^ + (ui -«2- v 3 -v 4 ) I I (A.3) 

27 

By the inequality of arithmetic and geometric means, the "=" in (fA~l~|) is true if 
and only if 

Pi + P4 = P2 = P3, Pl+P3=P2= PA, and pi + P2 = P3 = PA 

which implies p\ = 0, P2 = P3 = Pa — 1/3. Note that the "=" in (|A.2[) is true if 
P2 = P3 = Pa, an d the "=" in (|A.3p is true if pi = 0. After all, L = ui/27 if and 
only if pi = 0, p 2 = P3 = P4 = 1/3. 

(2) If p = (0, 1/3, 1/3, 1/3) maximizes (|3.2p . we claim that v\ > v 2 + v 3 + U4. 
Otherwise, if ui < ^2+^3+^4) the solution p e = (e, (1 — e)/3, (1 — e)/3, (1 — e)/3) 
will be better than p for small enough e > 0. It can be shown that 

L (p e ) > L(p) if and only if 2>d\ > e(3 + 6di — 2e — 3die) 



where d\ = (v 2 + V3 + t> 4 — V\)/v\ > 0. 



□ 
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2. Proof of Theorem \4.1.5 



To maximize R u (w) is equivalent to minimize 

V\ + V 2 + f 3 + V 4 



Q(vi,v 2 ,v 3 ,v 4 ) = 

V1P2P3P4 + V2P1P3P4 + V3P1P2P4 + V4P1P2P3 

where (pi,p 2 ,P3,P4) is the optimal allocation for the given (v±, v 2 , v 3 , v 4 ). Note 
that v 1 > v 2 > V3 > V4 implies that p\ < p 2 < p 3 < p 4 . 

(1) If v\ > v 2 + i>3 + «4, then p\ = and p 2 = p% = p 4 = 1/3 which gives 

R u W = l~- 4 [l + 

Suppose < a < Vi < b for i = 1, . . . ,4, then v ± > v 2 + v 3 + v 4 implies b > 3a 



and 



Q(vi,v 2 ,v 3 ,v 4: ) = 27 



1 + 



""2 + ^3+ V4 
Vl 





3a 


> 27 


1 + T 



The minimum of Q is attained at v\ = b and V2 = V3 = v 4 = a. So R 



(«) 



l-|(l + 3o/6) 



1/3 



Specifically, if vi = V2 + V3 + v 4 , Q(v%, v 2 , V3, v 4 ) = 54 and R u (w) = 1 — | X 2 1 / 3 . 

(A.4) 

(2) If vi < v 2 + v 3 + i> 4 , then pi > 0. Let 5 > be small enough so that 
p'i =Pi-5(l-p 1 ) > 0. LetpJ = {l + 5)pi, i = 2,3,4. Then p[ +p' 2 +p' 3 +p' 4 = 1. 
It can be verified that for any v[ > v\, 



Q{v[,v 2 ,V3,v 4 ) < - - - - 



^ + 1>2 + «3 + ^4 



1 — rT < Q{V 1 ,V 2 ,V3,V 4 ) 



V1P2P3P4 + v 2P'lP3P4 + v 3P'lP 2 P4 + V4P'iP' 2 P'3 

for small enough 5 > 0. On the other hand, it can be verified that 

lim Q(vi,v 2 ,V3,V4) = Q(v 2 + v 3 + v 4 , v 2 , v 3 , v 4 ) = 54 

flt(«2+f3+f4) 



regardless of the values of t>2,t>3,U4. So by ()A.4j) the maximum of Ru(w) is 
1 — I x 2 1 / 3 under the restriction vi < v 2 -\- v 3 + v 4 . □ 
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